Task-Driven Discretization of the Joint Space of Visual Percepts and Continuous Actions
نویسندگان
چکیده
We target the problem of closed-loop learning of control policies that map visual percepts to continuous actions. Our algorithm, called Reinforcement Learning of Joint Classes (RLJC), adaptively discretizes the joint space of visual percepts and continuous actions. In a sequence of attempts to remove perceptual aliasing, it incrementally builds a decision tree that applies tests either in the input perceptual space or in the output action space. The leaves of such a decision tree induce a piecewise constant, optimal state-action value function, which is computed through a reinforcement learning algorithm that uses the tree as a function approximator. The optimal policy is then derived by selecting the action that, given a percept, leads to the leaf that maximizes the value function. Our approach is quite general and applies also to learning mappings from continuous percepts to continuous actions. A simulated visual navigation problem illustrates the applicability of RLJC.
منابع مشابه
Task-space Control of Electrically Driven Robots
Actuators of robot operate in the joint-space while the end-effect or of robot is controlled in the task-space. Therefore, designing a control system for a robotic system in the task-space requires the jacobian matrix information for transforming joint-space to task-space, which suffers from uncertainties. This paper deals with the robust task-space control of electrically driven robot manipula...
متن کاملDiscrete time robust control of robot manipulators in the task space using adaptive fuzzy estimator
This paper presents a discrete-time robust control for electrically driven robot manipulators in the task space. A novel discrete-time model-free control law is proposed by employing an adaptive fuzzy estimator for the compensation of the uncertainty including model uncertainty, external disturbances and discretization error. Parameters of the fuzzy estimator are adapted to minimize the estimat...
متن کاملClosed-Loop Learning of Visual Control Policies
In this dissertation, I introduce a general, flexible framework for learning direct mappings from images to actions in an agent that interacts with its surrounding environment. This work is motivated by the paradigm of purposive vision. The original contributions consist in the design of reinforcement learning algorithms that are applicable to visual spaces. Inspired by the paradigm of local-ap...
متن کاملRobust Control of Electrically Driven Robots in the Task Space
In this paper, a task-space controller for electrically driven robot manipulators is developed using a robust control algorithm. The controller is designed using voltage control strategy. Based on the nominal model of the robotic arm, the desired signals for motor currents are calculated and then the voltage control law is proposed based on the current errors and motor nominal electrical model....
متن کاملRobust Control of Electrically Driven Robots in the Task Space
In this paper, a task-space controller for electrically driven robot manipulators is developed using a robust control algorithm. The controller is designed using voltage control strategy. Based on the nominal model of the robotic arm, the desired signals for motor currents are calculated and then the voltage control law is proposed based on the current errors and motor nominal electrical model....
متن کامل